AITopics | propaganda detection

Collaborating Authors

propaganda detection

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

MARSAD: A Multi-Functional Tool for Real-Time Social Media Analysis

Biswas, Md. Rafiul, Alam, Firoj, Zaghouani, Wajdi

arXiv.org Artificial IntelligenceDec-2-2025

MARSAD is a multifunctional natural language processing (NLP) platform designed for real-time social media monitoring and analysis, with a particular focus on the Arabic-speaking world. It enables researchers and non-technical users alike to examine both live and archived social media content, producing detailed visualizations and reports across various dimensions, including sentiment analysis, emotion analysis, propaganda detection, fact-checking, and hate speech detection. The platform also provides secure data-scraping capabilities through API keys for accessing public social media data. MARSAD's backend architecture integrates flexible document storage with structured data management, ensuring efficient processing of large and multimodal datasets. Its user-friendly frontend supports seamless data upload and interaction.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2512.01369

Genre: Research Report (0.64)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Hybrid Annotation for Propaganda Detection: Integrating LLM Pre-Annotations with Human Intelligence

Sahitaj, Ariana, Sahitaj, Premtim, Solopova, Veronika, Li, Jiaao, Möller, Sebastian, Schmitt, Vera

arXiv.org Artificial IntelligenceJul-25-2025

Propaganda detection on social media remains challenging due to task complexity and limited high-quality labeled data. This paper introduces a novel framework that combines human expertise with Large Language Model (LLM) assistance to improve both annotation consistency and scalability. We propose a hierarchical taxonomy that organizes 14 fine-grained propaganda techniques into three broader categories, conduct a human annotation study on the HQP dataset that reveals low inter-annotator agreement for fine-grained labels, and implement an LLM-assisted pre-annotation pipeline that extracts propagandistic spans, generates concise explanations, and assigns local labels as well as a global label. A secondary human verification study shows significant improvements in both agreement and time-efficiency. Building on this, we fine-tune smaller language models (SLMs) to perform structured annotation. Instead of fine-tuning on human annotations, we train on high-quality LLM-generated data, allowing a large model to produce these annotations and a smaller model to learn to generate them via knowledge distillation. Our work contributes towards the development of scalable and robust propaganda detection systems, supporting the idea of transparent and accountable media ecosystems in line with SDG 16. The code is publicly available at our GitHub repository.

large language model, machine learning, propaganda, (19 more...)

arXiv.org Artificial Intelligence

2507.18343

Country:

Europe (1.00)
Asia > Russia (0.29)

Genre: Research Report > New Finding (0.93)

Industry:

Government > Regional Government (0.47)
Media > News (0.47)
Government > Military (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Reasoning About Persuasion: Can LLMs Enable Explainable Propaganda Detection?

Hasanain, Maram, Hasan, Md Arid, Kmainasi, Mohamed Bayan, Sartori, Elisa, Shahroor, Ali Ezzat, Martino, Giovanni Da San, Alam, Firoj

arXiv.org Artificial IntelligenceFeb-23-2025

There has been significant research on propagandistic content detection across different modalities and languages. However, most studies have primarily focused on detection, with little attention given to explanations justifying the predicted label. This is largely due to the lack of resources that provide explanations alongside annotated labels. To address this issue, we propose a multilingual (i.e., Arabic and English) explanation-enhanced dataset, the first of its kind. Additionally, we introduce an explanation-enhanced LLM for both label detection and rationale-based explanation generation. Our findings indicate that the model performs comparably while also generating explanations. We will make the dataset and experimental resources publicly available for the research community.

dataset, detection, explanation, (14 more...)

arXiv.org Artificial Intelligence

2502.1655

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Middle East > Qatar (0.05)
North America > Canada > Ontario > Toronto (0.04)
(6 more...)

Genre: Research Report > New Finding (0.67)

Industry: Media > News (0.94)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.73)

Add feedback

MultiProSE: A Multi-label Arabic Dataset for Propaganda, Sentiment, and Emotion Detection

Al-Henaki, Lubna, Al-Khalifa, Hend, Al-Salman, Abdulmalik, Alqubayshi, Hajar, Al-Twailay, Hind, Alghamdi, Gheeda, Aljasim, Hawra

arXiv.org Artificial IntelligenceFeb-12-2025

Propaganda is a form of persuasion that has been used throughout history with the intention goal of influencing people's opinions through rhetorical and psychological persuasion techniques for determined ends. Although Arabic ranked as the fourth most- used language on the internet, resources for propaganda detection in languages other than English, especially Arabic, remain extremely limited. To address this gap, the first Arabic dataset for Multi-label Propaganda, Sentiment, and Emotion (MultiProSE) has been introduced. MultiProSE is an open-source extension of the existing Arabic propaganda dataset, ArPro, with the addition of sentiment and emotion annotations for each text. This dataset comprises 8,000 annotated news articles, which is the largest propaganda dataset to date. For each task, several baselines have been developed using large language models (LLMs), such as GPT-4o-mini, and pre-trained language models (PLMs), including three BERT-based models. The dataset, annotation guidelines, and source code are all publicly released to facilitate future research and development in Arabic language models and contribute to a deeper understanding of how various opinion dimensions interact in news media1.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2502.08319

Country:

Europe > Bulgaria (0.04)
Asia > Middle East > Saudi Arabia > Riyadh Province > Riyadh (0.04)
Asia > Middle East > Qatar (0.04)
(14 more...)

Genre: Research Report (0.83)

Industry:

Government (1.00)
Media > News (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

MemeMind at ArAIEval Shared Task: Spotting Persuasive Spans in Arabic Text with Persuasion Techniques Identification

Biswas, Md Rafiul, Shah, Zubair, Zaghouani, Wajdi

arXiv.org Artificial IntelligenceAug-8-2024

This paper focuses on detecting propagandistic spans and persuasion techniques in Arabic text from tweets and news paragraphs. Each entry in the dataset contains a text sample and corresponding labels that indicate the start and end positions of propaganda techniques within the text. Tokens falling within a labeled span were assigned "B" (Begin) or "I" (Inside), "O", corresponding to the specific propaganda technique. Using attention masks, we created uniform lengths for each span and assigned BIO tags to each token based on the provided labels. Then, we used AraBERT-base pre-trained model for Arabic text tokenization and embeddings with a token classification layer to identify propaganda techniques. Our training process involves a two-phase fine-tuning approach. First, we train only the classification layer for a few epochs, followed by full model fine-tuning, updating all parameters. This methodology allows the model to adapt to the specific characteristics of the propaganda detection task while leveraging the knowledge captured by the pre-trained AraBERT model. Our approach achieved an F1 score of 0.2774, securing the 3rd position in the leaderboard of Task 1.

detection, propaganda detection, propaganda technique, (14 more...)

arXiv.org Artificial Intelligence

2408.0454

Country:

Asia > Middle East > Qatar > Ad-Dawhah > Doha (0.04)
Europe > Italy > Tuscany > Pisa Province > Pisa (0.04)
Europe > Italy > Piedmont > Turin Province > Turin (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry:

Media (1.00)
Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Think Fast, Think Slow, Think Critical: Designing an Automated Propaganda Detection Tool

Zavolokina, Liudmila, Sprenkamp, Kilian, Katashinskaya, Zoya, Jones, Daniel Gordon, Schwabe, Gerhard

arXiv.org Artificial IntelligenceFeb-29-2024

In today's digital age, characterized by rapid news consumption and increasing vulnerability to propaganda, fostering citizens' critical thinking is crucial for stable democracies. This paper introduces the design of ClarifAI, a novel automated propaganda detection tool designed to nudge readers towards more critical news consumption by activating the analytical mode of thinking, following Kahneman's dual-system theory of cognition. Using Large Language Models, ClarifAI detects propaganda in news articles and provides context-rich explanations, enhancing users' understanding and critical thinking. Our contribution is threefold: first, we propose the design of ClarifAI; second, in an online experiment, we demonstrate that this design effectively encourages news readers to engage in more critical reading; and third, we emphasize the value of explanations for fostering critical thinking. The study thus offers both a practical tool and useful design knowledge for mitigating propaganda in digital news.

detection, explanation, propaganda, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3613904.3642805

2402.19135

Country:

Asia > Russia (0.14)
North America > United States > New York > New York County > New York City (0.06)
North America > United States > Hawaii > Honolulu County > Honolulu (0.06)
(13 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)

Industry:

Media > News (1.00)
Education > Educational Setting (1.00)
Health & Medicine > Therapeutic Area (0.68)
Government > Regional Government > North America Government > United States Government (0.45)

Technology:

Information Technology > Human Computer Interaction (1.00)
Information Technology > Communications > Social Media (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Paying Attention to Deflections: Mining Pragmatic Nuances for Whataboutism Detection in Online Discourse

Phi, Khiem, Faramarzi, Noushin Salek, Wang, Chenlu, Banerjee, Ritwik

arXiv.org Artificial IntelligenceFeb-15-2024

Whataboutism, a potent tool for disrupting narratives and sowing distrust, remains under-explored in quantitative NLP research. Moreover, past work has not distinguished its use as a strategy for misinformation and propaganda from its use as a tool for pragmatic and semantic framing. We introduce new datasets from Twitter and YouTube, revealing overlaps as well as distinctions between whataboutism, propaganda, and the tu quoque fallacy. Furthermore, drawing on recent work in linguistic semantics, we differentiate the `what about' lexical construct from whataboutism. Our experiments bring to light unique challenges in its accurate detection, prompting the introduction of a novel method using attention weights for negative sample mining. We report significant improvements of 4% and 10% over previous state-of-the-art methods in our Twitter and YouTube collections, respectively.

dataset, propaganda, whataboutism, (15 more...)

arXiv.org Artificial Intelligence

2402.09934

Country:

Asia > Russia (0.04)
Asia > China (0.04)
North America > United States > Virginia (0.04)
(9 more...)

Genre: Research Report > Promising Solution (0.68)

Industry:

Media > News (1.00)
Information Technology (1.00)
Government (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.69)

Add feedback

Exposing propaganda: an analysis of stylistic cues comparing human annotations and machine classification

Faye, Géraud, Icard, Benjamin, Casanova, Morgane, Chanson, Julien, Maine, François, Bancilhon, François, Gadek, Guillaume, Gravier, Guillaume, Égré, Paul

arXiv.org Artificial IntelligenceFeb-7-2024

This paper investigates the language of propaganda and its stylistic features. It presents the PPN dataset, standing for Propagandist Pseudo-News, a multisource, multilingual, multimodal dataset composed of news articles extracted from websites identified as propaganda sources by expert agencies. A limited sample from this set was randomly mixed with papers from the regular French press, and their URL masked, to conduct an annotation-experiment by humans, using 11 distinct labels. The results show that human annotators were able to reliably discriminate between the two types of press across each of the labels. We propose different NLP techniques to identify the cues used by the annotators, and to compare them with machine classification. They include the analyzer VAGO to measure discourse vagueness and subjectivity, a TF-IDF to serve as a baseline, and four different classifiers: two RoBERTa-based models, CATS using syntax, and one XGBoost combining syntactic and semantic features.

corpus, dataset, propaganda, (16 more...)

arXiv.org Artificial Intelligence

2402.0378

Country:

Asia > Russia (0.68)
Europe > France (0.15)
Europe > Ukraine > Kyiv Oblast > Kyiv (0.14)
(6 more...)

Genre: Research Report > New Finding (0.34)

Industry:

Media > News (0.95)
Government > Regional Government > Europe Government > Russia Government (0.46)
Government > Regional Government > Asia Government > Russia Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.89)

Add feedback

Large Language Models for Propaganda Detection

Sprenkamp, Kilian, Jones, Daniel Gordon, Zavolokina, Liudmila

arXiv.org Artificial IntelligenceNov-27-2023

The prevalence of propaganda in our digital society poses a challenge to societal harmony and the dissemination of truth. Detecting propaganda through NLP in text is challenging due to subtle manipulation techniques and contextual dependencies. To address this issue, we investigate the effectiveness of modern Large Language Models (LLMs) such as GPT-3 and GPT-4 for propaganda detection. We conduct experiments using the SemEval-2020 task 11 dataset, which features news articles labeled with 14 propaganda techniques as a multi-label classification problem. Five variations of GPT-3 and GPT-4 are employed, incorporating various prompt engineering and fine-tuning strategies across the different models. We evaluate the models' performance by assessing metrics such as $F1$ score, $Precision$, and $Recall$, comparing the results with the current state-of-the-art approach using RoBERTa. Our findings demonstrate that GPT-4 achieves comparable results to the current state-of-the-art. Further, this study analyzes the potential and challenges of LLMs in complex tasks like propaganda detection.

detection, propaganda detection, propaganda technique, (13 more...)

arXiv.org Artificial Intelligence

2310.06422

Country:

Europe > Switzerland > Zürich > Zürich (0.05)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.05)
Europe > Germany (0.04)
Europe > France (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Media (1.00)
Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

HQP: A Human-Annotated Dataset for Detecting Online Propaganda

Maarouf, Abdurahman, Bär, Dominik, Geissler, Dominique, Feuerriegel, Stefan

arXiv.org Artificial IntelligenceMay-1-2023

Online propaganda poses a severe threat to the integrity of societies. However, existing datasets for detecting online propaganda have a key limitation: they were annotated using weak labels that can be noisy and even incorrect. To address this limitation, our work makes the following contributions: (1) We present HQP: a novel dataset (N=30,000) for detecting online propaganda with high-quality labels. To the best of our knowledge, HQP is the first dataset for detecting online propaganda that was created through human annotation. (2) We show empirically that state-of-the-art language models fail in detecting online propaganda when trained with weak labels (AUC: 64.03). In contrast, state-of-the-art language models can accurately detect online propaganda when trained with our high-quality labels (AUC: 92.25), which is an improvement of ~44%. (3) To address the cost of labeling, we extend our work to few-shot learning. Specifically, we show that prompt-based learning using a small sample of high-quality labels can still achieve a reasonable performance (AUC: 80.27). Finally, we discuss implications for the NLP community to balance the cost and quality of labeling. Crucially, our work highlights the importance of high-quality labels for sensitive NLP tasks such as propaganda detection.

machine learning, natural language, propaganda, (18 more...)

arXiv.org Artificial Intelligence

2304.14931

Country:

Asia > Russia (1.00)
Europe > Russia (0.06)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
(5 more...)

Genre: Research Report (0.82)

Industry:

Government > Military (1.00)
Government > Regional Government > Europe Government > Russia Government (0.94)
Government > Regional Government > Asia Government > Russia Government (0.94)
Media > News (0.68)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback